San Francisco State University (SFSU) at Total Recall Track of TREC 2016
نویسندگان
چکیده
This paper describes the participation of San Francisco State University group in Text Retrieval Conference (TREC) 2016 Total Recall Track from National Institute of Standard and Technology (NIST). The TREC series provide large test collections and judgements for participant to design Information Retrieval (IR) systems for different proposes. The purpose of Total Recall Track is seeking text search system which achieves high recall with minimum number of return documents. This year, our team participates all automatic tasks, including 34 topics in athome task and 2 datasets in sandbox task. Our system is built based on the autonomous technology-assisted review (Auto TAR) model[1], which is also the baseline of Total Recall Track. In this paper, we will introduce several approaches which have improved the evaluation metrics compare to the baseline model. Our enhanced model combines seed expansion and feature engineering including adding n-gram, eliminating stop words, and preserving words contain digits.
منابع مشابه
San Francisco State University at LiveQA Track of TREC 2016
There are many situations in our everyday life where we look for answers to some questions “Who wrote this book?”, “How to grill a fish?” or “Where is the Opera House located?”. Twenty years ago to answer these questions people were looking them up in the encyclopedias, recipe books or were asking other people. Moving the information into the electronic form and making it universally access...
متن کاملThe University of Padua (IMS) at TREC 2016 Total Recall Track
The participation of the Information Management System (IMS) Group of the University of Padua in the Total Recall track at TREC 2016 consisted in a set of fully automated experiments based on the two-dimensional probabilistic model. We trained the model in two ways that tried to mimic a real user, and we compared it to two versions of the BM25 model with different parameter settings. This initi...
متن کاملTodos Cuentan: Cultivating Diversity in Combinatorics.
In nine years the Combinatorics Initiative between San Francisco State University (SFSU) and the nation of Colombia has built an active community of more than two hundred mathematicians, most of whom are members of underrepresented groups in mathematics. More than fifty have pursued PhDs in mathematics, while others continue to be mathematics users, enthusiasts, and ambassadors in other fields ...
متن کاملThe University of Amsterdam (ILPS) at TREC 2015 Total Recall Track
We describe the participation of the University of Amsterdams ILPS group in the Total Recall track at TREC 2015. Based on the provided Baseline Model Implemention (”BMI”) we set out to provide two more baselines we can compare to in future work. The two methods are bootstrapped by a synthetic document based on the query, use TF/IDF features, and sample with dynamic batch sizes which depend on t...
متن کاملWebis at TREC 2016: Tasks, Total Recall, and Open Search Tracks
We give a brief overview of the Webis group’s participation in the TREC 2016 Tasks, Total Recall, and Open Search tracks. Our submissions to the Tasks track are similar to our last year’s system. In the task understanding subtask of the Tasks track, we use different data sources (ClueWeb12 anchor texts, AOL query log, Wikidata, etc.) and APIs (Google, Bing, etc.) to retrieve suggestions related...
متن کامل